NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An adolescent and near-resonant planetary system near the end of photoevaporation

https://doi.org/10.1038/s41550-026-02795-9

Wang, Mu-Tian; Dai, Fei; Liu, Hui-Gen; Chen, Howard; Hu, Zhecheng; Petigura, Erik; Giacalone, Steven; Lee, Eve; Goldberg, Max; Leleu, Adrien; et al (February 2026, Nature Astronomy)

Full Text Available
What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning

https://doi.org/10.18653/v1/2023.findings-acl.527

Pan, Jane; Gao, Tianyu; Chen, Howard; Chen, Danqi (July 2023, Association for Computational Linguistics)
Language Models as Science Tutors

Chevalier, Alexis; Geng, Jiayi; Wettig, Alexander; Chen, Howard; Mizera, Sebastian; Annala, Toni; Aragon, Max_Jameson; Rodriguez_Fanlo, Arturo; Frieder, Simon; Machado, Simon; et al (May 2024, International Conference on Machine Learning)

Full Text Available
C-STS: Conditional Semantic Textual Similarity

https://doi.org/10.18653/v1/2023.emnlp-main.345

Deshpande, Ameet; Jimenez, Carlos; Chen, Howard; Murahari, Vishvak; Graf, Victoria; Rajpurohit, Tanmay; Kalyan, Ashwin; Chen, Danqi; Narasimhan, Karthik (January 2023, Computational linguistics Association for Computational Linguistics)

Full Text Available
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Yao, Shunyu; Chen, Howard; Yang, John; Narasimhan, Karthik (January 2022, Advances in neural information processing systems)

Most existing benchmarks for grounding language in interactive environments either lack realistic linguistic elements, or prove difficult to scale up due to substantial human involvement in the collection of data or feedback signals. We develop WebShop – a simulated e-commerce website environment with 1.18 million real-world products and 12,087 crowd-sourced text instructions. In this environment, an agent needs to navigate multiple types of webpages and issue diverse actions to find, customize, and purchase a product given an instruction. WebShop provides several challenges including understanding compositional instructions, query (re-)formulation, dealing with noisy text in webpages, and performing strategic exploration. We collect over 1,600 human trajectories to first validate the benchmark, then train and evaluate a diverse range of agents using reinforcement learning, imitation learning, and pre-trained image and language models. Our best model achieves a task success rate of 29%, which significantly outperforms rule heuristics but is far lower than expert human performance (59%). We also analyze agent and human trajectories and ablate various model components to provide insights for developing future agents with stronger language understanding and decision making abilities. Finally, we show our agent trained on WebShop exhibits non-trivial sim-to-real transfer when evaluated on amazon.com and ebay.com, indicating the potential value of our benchmark for developing practical web agents that can operate in the wild.
more » « less
Full Text Available
TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments

https://doi.org/10.1109/CVPR.2019.01282

Chen, Howard; Suhr, Alane; Misra, Dipendra; Snavely, Noah; Artzi, Yoav (June 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available

Search for: All records